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Abstract. Let T : f2 — > Qnxn ^ matrix- valued function that is analytic on some simply- 
connected domain H C C A point A £ f! is a (nonlinear) eigenvalue if T(A) is singular. In this paper, 
we describe new localization results for nonlinear eigenvalues, generalizing Gershgorin's theorem, 
pseudospectral inclusion theorems, and the Bauer-Fike theorem. We use our results to analyze 
three nonlinear eigenvalue problems: an example from delay differential equations, a problem due to 
Hadeler, and a quantum resonance computation. 
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1. Introduction. In this paper, we study the nonhnear eigenvalue problem 

T(A)w = 0, v^Q, (1.1) 
where T : — ^ £ny.n jg a^aiy^jc on a simply-connected domain $7 C C. Prob- 



lem (1.1 1 occurs in many applications [4||22], often from applying transform methods 
to analyze differential and difference equations. The best-studied nonlinear eigenvalue 
problems are those for which T is polynomial in A [TT], and particularly those that 
are quadratic in A [ISl [32] ■ More general nonlinear eigenvalue problems that involve 
algebraic or transcendental matrix functions are prevalent in models with delay [24] 
or radiation [28 l l35 l l36] . 

In this paper, we consider localization results that define regions in which any 
eigenvalues must lie. Localization regions such as pseudospectra |32j and Gershgorin 
disks |33| are widely used in the analysis of ordinary eigenvalue problems. In error 
analysis, localization results bound how much numerically computed eigenvalues are 
affected by roundoff and other approximations, particularly when the approximation 
error is not tiny or the eigenvalues in question are ill-conditioned. Localization regions 
that are crude but easy to compute are used in linear stability of dynamical systems, 
as an easy way to see that a matrix has no eigenvalues in the right half plane or outside 
the unit disk. Crude localization results are also used to find good shifts for spectral 
transformations commonly used with iterative eigensolvers. But though localization 
is as useful for nonlinear eigenvalues as for linear eigenvalue problems, little has been 
done to adapt standard localization results to the nonlinear case. 

Several researchers have explored perturbation theory for polynomial eigenvalue 
problems [U El HZl UHl HI] , but this work has limited applicability outside the polyno- 
mial case. Similarly, the research into nonlinear pseudospectra |7l El HU [23l [24l |34] has 
primarily focused on specific types of eigenvalue problems, such as polynomial prob- 
lems or problems arising from delay differential equations. Apart from some work 
in the context of delay differential equations [TH] , we are not aware of any efforts to 
extend localization results such as Gershgorin's theorem or the Bauer-Fike theorem 
to the nonlinear case. The key contribution of this paper is the adaptation of these 
and other such results to general nonlinear eigenvalue problems. 
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The rest of this paper is organized as follows. In Section [2j we recall some useful 
standard results from the theory of analytic matrix-valued functions. In Section [3j 
we describe a generalized Gershgorin theorem for nonlinear eigenvalue problems, and 
in Section [4] we introduce and discuss nonlinear pseudospectra. We then turn to the 
useful special case of linear functions with nonlinear perturbations in Section [Sj where 
we describe analogues of Gershgorin's theorem and the Bauer-Fike theorem for this 
case. We illustrate the usefulness of our bounds through some examples in Section [6] 
and conclude in Section [71 

2. Background. We assume throughout this paper that J7 C C is a simply- 
connected domain and T : J7 — )■ C"^" is analytic and regular, i.e. det(T(2:)) ^ 0. For 
T regular, the zeros of det(r(z)) are a discrete set with no accumulation points in 
n. We call A e f2 a nonlinear eigenvalue with multiplicity m if det(T(z)) has a zero 
of order m at A. As with standard eigenvalue problems, when we count eigenvalues 
in a region, we always count multiplicity. The set of all nonlinear eigenvalues is the 
nonlinear spectrum A(r). 

Because they are zeros of analytic functions, nonlinear eigenvalues move continu- 
ously under analytic perturbation: if A G A(T) has multiphcity m and E' : — > C"^" 
is analytic, then, for small enough e > 0, there is some (5 > so that if |s| < (5 then 
the ball of radius e about A contains exactly m eigenvalues of T -I- sE. The proof 
depends on Rouche's theorem [H]: 

Theorem 2.1 (Rouche). Suppose f and g are analytic on and inside a simple 
closed contour F and \ f{z)\ > \g{z)\ for each z e F. Then f and f + g have the same 
number of zeros inside T, counting multiplicities. 

Proof. Let / : C — > C be nonzero and analytic on a simple, positively-oriented, 
closed contour F and mcromorphic inside F. The winding number for / about F is 

1 r f'(z) 

Wrif) / dz = # zeros - # poles of / inside F. 

27ri Jr f(z) 

If \f{z)\ > \g{z)\ for any z e F, then for any s £ [0, 1], 

|/(z) + sg{z)\ > \fiz)\ - s\giz)\ > |/(z)| - \giz)\ > 0. 

Because Wr{f + sg) is integer- valued and continuously defined for < s < 1, it must 
be constant. So Wrif + 9) = WAf)- □ 

Rouche's theorem has two ingredients: a count based on winding numbers, and 
a condition to establish that the winding number does not change along a homotopy. 
This recipe extends to other homotopies and other conditions that guarantee continu- 
ity of the winding number along the path. For example, we can apply this argument 
to count nonlinear eigenvalues inside a contour: 

Theorem 2.2. Suppose T : fl C"^" and E : ^ C"^" are analytic, and that 
T d is a simple closed contour. IfT(z) + sE{z) is nonsingular for all s G [0, 1] and 
all z € F, then T and T + E have the same number of eigenvalues inside T, counting 
multiplicity. 

Proof. Define /(z; s) = det(r(z) + sE(z)). The winding number of /(z; s) around 
F is the number of nonlinear eigenvalues of T + sE inside F. For z S F and s € [0, 1], 
by hypothesis, T{z) + sE{z) is nonsingular, and so /(z; s) 7^ 0. Hence, the winding 
number is continuously defined (and thus constant) for s G [0, 1]. □ 

Note that 



log detT(z)=tr(T(z)- iT'(z)), 
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and so the winding number in Theorem 2.2 is tr(Pr)j where 

In the special case where T{z) — A — zB, Pp is the spectral projector onto a right 
deflating subspace associated with eigenvalues inside F; see, e.g. [151 §VI.2.4]. More 
generally, if a contour F encloses only simple eigenvalues, as is generically the case, 
then Pp is a sum of right spectral projectors for the zero eigenvalues of each of the 
pencils (r(A),r'(A)) for A e A(T) inside F. 

A corollary to Theorem |2.2| is the matrix version of an operator Rouche's theorem 
due to Gohberg and Sigal [l2] : 

Theorem 2.3. Suppose T : n ^ C"^" and E : il C"^" are analytic and 
T CZ ft is a simple closed contour. If ||T'(z)^^£'(z)|| < 1 for all z G F, then T and 
T + E have the same number of eigenvalues inside F, counting multiplicity. 

Proof. Write 

T(z) + sE{z) = T{z) (/ + sT{z)-^E{z)) . 
The hypothesis guarantees the Neumann series 

oo 

(/ + sT{zr'E{z)y' = ^(-.r(z)-ii?(z))^- 

converges for s G [0, 1], so T{z) + sE{z) is nonsingular for all s e [0, 1] and z € F. By 
Theorem |2.2[ T and T + E have the same number of eigenvalues inside F. □ 



3. Nonlinear Gershgorin bounds. Theorem [2?2] provides a template for con- 
structing inclusion regions to compare the nonlinear spectra of two related problems. 
The following is a nonlinear generalization of Gershgorin's theorem that allows us to 
compare the nonlinear spectrum of a general matrix- valued function to the zeros of a 
list of simpler, scalar- valued functions. 

Theorem 3.1. Suppose T{z) — D{z) + E{z) where D,E : fl £nxn analytic 
and D is diagonal. Then for any < a < 1, 

n 

A{T) c U G,-„, 

where Gj^a *s the jth generalized Gershgorin region 

G^ = {zen:\d,,{z)\<r,{zrc,izy"^} 
and Tj and Cj are the jth absolute row and column sums of E, i.e. 

n n 
A;=l i=l 

Moreover, suppose that U is a bounded connected component of the union [Jj such 
that U d Q. Then U contains the same number of eigenvalues of T and D; and iflA 
intersects m Gershgorin regions, it must contain at least m eigenvalues. 
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Proof. If z e is not in IJ^ G", then, for each j, 

\dn\>rfc]~'^^{f, + \e,,\r(c, + \e,,\y 

where 



Sjk\, Cj 



El 



are the deleted absolute row and column sums of E. Applying Holder's inequality 
with p = 1/a and q = 1/(1 — a), we have 



and by the triangle inequality, 



Therefore, for each j, 



l-a 



and so by a nonsingularity test of Ostrowski [33^ Theorem 1.16], T{z) is nonsingular. 
The same argument shows that D + sE is nonsingular for any < s < 1. 

Because D + sE is nonsingular outside the Gershgorin regions. Theorem [2[2] im- 
plies that any closed contour in ft that does not pass through IJ^ G" contains the 
same number of eigenvalues from A(r) and A(L)), counting multiplicity. Thus, if U 
is a bounded connected component of IJ^- G", U C ft, then D and T must have the 
same number of eigenvalues inside U. For each Gershgorin region that intersects 
D, and thus T, has at least one eigenvalue inU. D 

The usual statement of Gershgorin's theorem corresponds to the special case when 
T(z) = A— zl = {D — zl) + E, where D is the diagonal part of A and a is set to zero 
or one. Then the Gershgorin regions are simply disks, and a component consisting 



of m disks contains ra eigenvalues. However, Theorem 3.1 involves some hypotheses 
that are not needed for the standard version of Gershgorin's theorem. We illustrate 
the role of these hypotheses through three examples. 
Example 1. For the matrix 



T{z) 



we have Gershgorin regions G\ = {0} and G\ — {z : \z\ > 1}. The first region 
contains the sole finite eigenvalue for the problem. The second region contains no 
finite eigenvalues, though there is an eigenvalue at infinity. This does not violate the 



counting result in Theorem 3.1 since the second region is unbounded. 
Example 2. Consider the matrix 



T{z) 



1 





0.5 
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Fig. 3.1. Gershgorin regions in Example [g| (left) and Example \3\ (right). In Example 2, the 
second Gershgorin region consists of two pieces, and the union of G\ and contains three eigen- 
values. In Example 3, neither Gershgorin region contains eigenvalues; but both G\ and Gj intersect 
(—00,0], which is not in the domain O for this problem. 



The Gershgorin regions are shown in Figure 3.1 (left). For this problem, G\ is the 
closed unit disk, G\ consists of two roughly circular components around ±1, and G\ 
is empty. The region U = G\\J G\ intersects two Gershgorin regions, and contains 
3 > 2 eigenvalues. Unlike Gershgorin disks in the standard problem, each bounded 
Gershgorin region may contain one or many eigenvalues. 
Example 3. Consider the matrix 



T(z) 



0.2v^+l -1 
0.4VZ 1 



defined on f2 = C — (— oo,0], where ^/z is taken to be the principal branch of the 
square root function. If we let D{z) be the diagonal of T{z), the Gershgorin regions 
are as shown in Figure [O] (right) . Note that 



det(i:>(z)) = z-0.2V^ + 1 = (\/^-0.1 -i%/a99)(V^- 0.1 + i%/a99) 
has two solutions on the primary sheet of the square root function, but 

det(T(z)) = z + 0.2V^ + 1 = (V^ + 0.1 - i%/a99)(yi + 0.1 + iVoM) 
only has solutions on the second sheet of definition. Thus, the set G\ contains two 



eigenvalues of D{z), but no eigenvalues of T(z). This does not violate Theorem 3.1 
because the closed set includes [—1,0] ^ $7. 

4. Pseudospectral regions. The spectrum of a matrix A is the complement of 
the resolvent set, i.e., the set of z such that the resolvent operator R{z) = {zl — A)^^ 
is well-defined. The e-pseudospectrum of A is equivalently defined as 

A,(A)^{z:||i?(z)||>6-i} (4.1) 
^ U A{A + E), (4.2) 

ll-E||<e 

with the convention that ||i?(A)|| = oo when A e A(j4). 
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Several authors have worked on nonhnear generaUzations of pseudospectra [71 [9l 



[221 [Ml EH ■ The usual definitions of nonlinear pseudospectra generalize (4.2). Let 
be a space consisting of some set of analytic matrix-valued functions of interest; 
then the e-pseudospectrum for T e is 

Ae(T)= U A{T + E). (4.3) 

Be-7^.ll-B||giob<e 

where ||i?||giob is a global measure of the size of the perturbing function E. For 
polynomial eigenvalue problems and nonlinear eigenvalue problems from the analysis 



of delay differential equations, many authors use the definition (4.3) with 



^ = ^Jl'^^P^W : A. e C"><"| , IMIgiob = function of Aq, Ai, . . . , A„ (4.4) 

where the functions Pi{X) are fixed entire functions 24, Chapter 2]. However, we wish 
to use our results to compare nonlinear eigenvalue problems with different types of 
dependencies on A; for example, we want to compare problems with transcendental 
dependence on A to approximations that have polynomial or rational dependence on 
A. For this purpose, there may not be a natural formulation in terms of a standard 
set of coefficient functions. 

We take J" to be the space of all analytic matrix-valued functions C"(r2,C"^"), 
and measure size with 

||ii;||g,ob = sup||i?(z)|l. (4.5) 



Using the general definition (4.3 ) with the size measure (4.5 ), we have three equivalent 
expressions for the nonlinear pseudospectra, similar to the equivalent definitions for 
ordinary pseudospectra; see [321 Theorem 2.1]. 

Theorem 4.1. Let £ = {E : n ^ C"^" s.t. E analytic, sup^^^\\E{z)\\ < e} 
and So — {Eq e i^^nxn . < £|_ Then the following definitions are equivalent: 

A,{T) ^ {z e n : \\R.{z)\\ > e-^} (4.6) 

= IJ A(r + £;) (4.7) 

Ee£ 

- U AiT + Eo). (4.8) 



Proof Denote the sets in (|46|, (|4j|, and ([48| as Al{T), Al{T), and Al{T). We 
break the proof into three steps: 

z e Al{T) <^ z € A3(r): If T{z) + E{z) is singular for some S e £, 
then T{z) -f Eq is singular for E^ = E{z). Since Eo e So, it follows that z € A^iT). 
Conversely, if T{z) + Eo is singular for some Eo G Soi then T{z) + E{z) is singular for 
E the constant function Eq. 

z ^ Al{T) =^ z ^ A3(T): Suppose \\T{z)-^\\ < e^i. Then for any Eo such 
that ||i?o|| < e, we have that ||r(2:)~^i5o|| < Ij so there is a convergent Neumann series 
for / + T{z)-^Eo. Thus, {T{z) + Eo)'^ = (I + T{z)-'^Eoy^T{z)-^ is weU defined. 

z € Aj(T) z e Af(r): Eigenvalues of T belong to both sets, so we 

need only consider z G A^(r) not an eigenvalue. So suppose T{z) is invertible and 
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s^^ — ||r(z)^"'^|| > e^^. Then T{z)^^u = s^^v for some vectors u and v with unit 
norm; alternately, write su = T{z)v. Let Eq — ~suw* , where w* is a dual vector of 
V. Then ||i?o|| = s < e, and T{z) + is singular with u as a null vector. □ 

The e-pseudospectra clearly contains the ordinary spectrum, but we can say more. 
The following theorem is nearly identical to the analogous statement for ordinary 
pseudospectra [32l Theorem 4.2]: 

Theorem 4.2. Suppose T : il — > C"^" is analytic andlA is a hounded connected 
component of A^(T) such thatU C fl. ThenlA contains an eigenvalue ofT. 

Proof. If T{z)~^ were analytic on U, \\T{z)~^\\ would be subharmonic on U, and 
the maximum value of ||T(z)~"'^|| would have to lie on dU. But ||r(z)^^|| = e for 
z e dU, and ||T(2)^^|| > e ioi z eU. Therefore, T{z)^^ cannot be analytic on U, i.e. 
there is an eigenvalue inU. U 

A useful feature of pseudospectra is the connection with backward error, and this 
carries over to the nonlinear case: 

Theorem 4.3. Suppose T{X)x = r and ||r||/||a;|| < e. Then A G AeiT). 

Proof Define E ^ -pp- Then (T(A) + E)x = and \\E\\ = ||r||/||a;|| < e. □ 

We can also compare eigenvalue problems via pseudospectra. As discussed in the 
next section, this is particularly useful in the case when one of the problems is linear. 

Theorem 4.4. Suppose T : ^ C"^" and : ^ C"^" are analytic, and let 
n, = {zen: \\E{z)\\ < e}. 

Then 

A{T + E)nn, c Ae(T)na. 

Furthermore, ifU is a bounded connected component of A^{T) such thatlA C ile, then 
lA contains exactly the same number of eigenvalues ofT and T + E. 

Proof. The inclusion result is obvious based on the characterization of the non- 
linear pseudospectra as unions of spectra of perturbations to T. The counting result 
follows from the continuity of nonlinear eigenvalues: the set Ag(T) n lie contains 
A(r + sE) n for all < s < 1, so for each eigenvalue of T + £' m U, there is a 
continuously-defined path to a corresponding eigenvalue T that remains inU. D 

5. Nonlinear perturbations of linear eigenvalue problems. A lineariza- 
tion of a matrix polynomial P : C ^ C"^" is a pair {A, B) e C^"'^)''^"'^) such that the 
polynomial P and the pencil (v4, B) have the same spectrum. There are many possi- 
ble linearizations, and significant effort has gone into characterizing linearizations and 
their structural properties [201 HI] • More recent work addresses similar linearizations 
for rational eigenvalue problems [27j . One way to find the spectrum of a nonlinear 
matrix function T is to approximate T by some rational or polynomial function T, 
then find eigenvalues of T through a linearization. In this case, the spectrum of T 
can be analyzed as a nonlinear perturbation of a linearization of T. 

We follow a simple strategy to generalize standard perturbation theorems for 
linear eigenvalue problems to the case where the perturbations are nonlinear. Let 
T : f7 ^ C"""' have the form 

T{z) = A - zB + E{z), 

and suppose we can bound E, either in norm or in the magnitude of individual 
components, over a domain fJ^; C 51. We then apply perturbation theorems from the 
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linear case that are valid for any fixed perturbation which is similarly controlled. This 
argument gives us a set that includes all eigenvalues of T inside ^Ie- By continuity of 
the eigenvalues, if is a bounded connected component such that U C ^Ie, then U 
contains the same number of eigenvalues of T as of the linear pencil A — zB. 

Perhaps the simplest bound of this sort involves the pseudospectra of the gener- 
alized eigenvalue problem: 

Theorem 5.1. Suppose : — > C"^" is analytic, and let 

a = {2 e r2 : ||£^(z)|| < e}. 
Suppose also that [A, B) is a regular pencil. Then for T = A — zB + E{z), 

A(r)na c A,(A5), 

where Ae(v4, B) denotes the e-pseudo spectrum for the pencil A — zB, i.e. 

A,{A, B) = {z e C : \\iA~zB)-^\\ > e'^}. 

Furthermore, iflA is a hounded connected component of A^{A, B) such that U C fie, 
thenU contains exactly the same number of eigenvalues ofT and of the pencil {A, B). 
Proof. This is a special case of Theorem |4.4| □ 

The pseudospectral bound is simple, but computing the pseudospectra of a pencil 
may be expensive. Consequently, we may be better served by Gershgorin bounds. 
Theorem 5.2. Suppose 

T{z) = D - zl + E{z) 

where D G C"X" jg diagonal and _B ; — > C"^" is analytic. Suppose also that the 
absolute row and column sums of E are uniformly bounded, i.e. Vz £ Q, 

n n 

hj{z)\ < r„ \eijiz)\ < Cj. 

Then for any < a < 1, the eigenvalues of T lie in IJ"^^ '^^^'"'^ ihe Gi are 
generalized Gershgorin disks 

G, = {z e C : |z - du\ < p^}, P^ = rfc]-''. 

Furthermore, iflA is a union of k disks which are disjoint from the remaining disks, 
and iflA C ri, then lA contains exactly k eigenvalues. 
Proof. This is a direct corollary of Theorem |3.1[ □ 

Like the ordinary Gershgorin theorem, the nonlinear Gershgorin theorems |3 . 1 1 and 
the corollary |5.2| are particularly powerful in combination with an appropriate change 
of basis. As an example, we have the following nonlinear version of a well-known 
corollary of a theorem due to Bauer and Fike 01 Theorem IV]: 

Theorem 5.3. Suppose 

T{z) = A - zl + E{z), 

where A E C"^" has a complete basis of eigenvectors V E and i? : — > C"^" 

is analytic. Suppose also that \E(z)\ < F componentwise for all z G f2. Then the 
eigenvalues of T in lie in the union of disks 

n 

y {z e C : |z - A,| < 0J , = n\\F\\2 sec{0,), 
1=1 
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where (Xi,Wi,Vi) are eigentriples of A and 9i is the angle between the left and right 
eigenvectors Wi and Vi. If U is a union of any k of these disks that are disjoint from 
the remaining disks, and ifU C thenlA contains exactly k eigenvalues of T . 



Proof. The proof follows by applying Theorem 5.2 to V^^T{z)V and bounding 
the row sums of |y~"'^i?(z)y|. Without loss of generality, assume the columns of V 
are normalized to unit Euclidean length. For any z e $7, note that the absolute row 
sum ri{z) of V^^EV is bounded by 

<eJ\V-^\F\V\e, 

where € E" is the ith standard basis vector and e G K" is the vector of all ones. 
Let w* — e*V~^ be the ith. left eigenvector, and note that the normalization of V 
implies that || |V^|e||2 < n. Therefore, 

|r.(z)|<||ef|l/-i|||2||i^||2|||V^|e||2<||i«dl2||i^||2n. 
Note that w*Vi — 1 and — 1 by the normalization conditions, so 

Wmh = — r~r-\ — = sec(6'0. 

Therefore, we have the uniform bound 

\r,{z)\< (j), = n\\F\\2 sec{e^). 

□ 

In the next section, we give several additional examples that involve Gershgorin 
theory together with a change of basis. 

6. Applications. We now apply the results of this paper to three problems. The 
first two are nonlinear eigenvalue problems from the NLEVP collection [4^ : a problem 
due to Hadeler and a problem from the analysis of delay differential equations. For 
each of these problems, we compare the original nonlinear function to two other 
functions in order to localize the spectrum over different parts of the complex plane. 
The third example concerns resonances for one-dimensional quantum scattering from 
a piecewise-constant potential |37L [5]. characterized as nonlinear eigenvalues of a small 
matrix. After approximating the resonances via a generalized eigenvalue problem, we 
use Theorem |4.4| to check that we have computed all the resonances within a contour. 



6.1. Hadeler. The Hadeler problem in the NLEVP collection has the form 

T{z) = B(cxp(z) - 1) + Az^ - al 
where A,Bg M*^* are real and symmetric positive definite. The spectrum consists 



of sixteen simple real eigenvalues and infinitely many complex eigenvalues. Figure 6.1 
shows the spectrum and pseudospectra over part of the complex plane. As is evi- 
dent from the figure, the spectrum is highly structured away from the real line, with 
regularly-spaced clusters of eight eigenvalues near certain curves. We use Theorem |3.1| 
to compare T(z) to two simpler problems in order to localize both the eigenvalues close 
to the real line and those that are farther away. 
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Fig. 6.1. Spectrum and pseudospectra for the Hadeler problem 

6.1.1. Comparison to a simplified function. For \z\ ^ 0, T{z) is dominated 
by either B e^p{z) or Az'^. The remaining constant term becomes relatively insignif- 
icant. Therefore, our first set of bounds comes from comparing the Hadeler function 
to a simplified version without the constant term. 

Let U he a matrix of A-orthonormal eigenvectors for the pencil (B, A), and define 

f{z) = U^T{z)U = Db exp(z) + Iz^ + E 

where Db is the diagonal matrix of eigenvalues /3i > /32 > • • • > /^s > and E = 
— U'^{al + B)U is sl constant matrix. We will compare T to the simplified function 

f{z) = Db exp(z) + Iz^ = diag , /,(z) = (3, exp(z) + , 

for which the spectrum is 

A(f ) = {zeC: z^ exp(-z) = 

Though eigenvalues of T are not available in closed form, we can reliably compute 
them. Write z in both polar and Cartesian form as z = pexjp{i9) — x + iy; then 

2^ exp(-z) = p^e'"^ exp{i{29 - y)). 

Therefore, the pre-image of the negative real axis corresponds to the solutions to 
y = 20 + (2m — 1)tt, which gives the contours 

Zm{0) = y(^'^ + ?j ^{29+ (2m - l)7r)(cot(6l) + i), 

where < 9 < tt. For m 7^ 0, Zra{9)'^ exp{— Zmi9)) is monotone in 9 over < 6* < tt; for 
m = 0, z„i{9)'^ exp{~ z,ni9)) is monotone on < < tt/2 and tt/2 < < tt. Thus, one 
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Fig. 6.2. Gershgorin region (solid line) containing eigenvalues of the Hadeler function T (dots) 
and the simplified problem T (stars). Each connected component contains the same number of 
eigenvalues for both problems. The Gersgorin region is somewhat complicated, but it can be shown 
to lie within a simpler region consisting of one large disk of radius about 16.2 around the origin and 
much smaller disks around the eigenvalues ofT. 



can reliably compute the eigenvalues of T for each m by applying a bisection-guarded 
solver to 2^(0)^ exp(— Zm(6')) = —(3j. 
Using Theorem |3.1[ we know that 



A{f)c\jG]^\J{z:\f,{z)\<p,}, 



fc=i 



Furthermore, any connected comp onen t of this region contains the same number of 



eigenvalues of T and T. In Figure 6.2 we show a plot of the Gershgorin regions in 



the complex plane, with the location of the eigenvalues of T marked by asterisks. 



The Gershgorin regions in Figure 6.2 are somewhat complicated, but we can 
bound them in simpler sets. Each region Gj is contained in a narrow region 



{zeC:\^,e'-p'\<p,}DG]-- 

that is transected by the contours Zm{0) 
these contours, we have 7 = exp(— z) G 



\m\ = 



{zeC: \f3j exp{z) + z^\ < pj} 

- {29 + 2rmr){cot{e) + i), m ^ 0. Along 
, so 

>l=l=; 



7 

hence, outside a ball of radius the Gershgorin region cannot intersect these 

contours. Thus, sufficiently far from the origin, each connected component of 

contains exactly one eigenvalue A of T. Moreover, each of these connected components 
of Gj lies within a disk of radius 0(|A|~^), as we now show. 
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If fj{X) — 0, then for any w, 

fj{X + w) = -AV(u;) + A(2 - X)w + 

where r{w) — exp(u') — 1 — w is the error in a Taylor approximation to exp(w). For 
< 0.5, we can bound < ^/e/2\w\'^^, from here, we have the lower bound 

\f,{X + w)\>\X{2-X)w\-\X\{w)\~\w^\ 

>\X\{\X\~2)\w\-\X\M^)\~\w\' 

> |A|(|A| - 2)|u;| - \X\^(V^/2\w\') - \w\' = cb{H, \M) 

where 

</.(r;|A|)^|A|(|A|-2)r-(l + y^|A|V2y. 

Solving the quadratic equation ^(0.5; R) — max^ pj, we find 0(0.5; |A|) > max^ pj for 
|A| > i? « 16.2. Therefore, for any |A| > R, the quadratic equation (j){r; |A|) — pj has 
real roots < r_ < 0.5 < r+, and \fj{z)\ > (j){\z — A|; |A) > pj in the annular region 
{z : — A| e (r_,r+)}. Thus, the component of Gj containing A lies inside a ball of 
radius r_ around A. Because r_r+ = Pj/{1 + y/e/2\X\'^), we have 

r_ < 2pj/{l + V^J2\X\^) < 2mpj\X\-^ < 0.5; 

thus, 

G] C U : \z - X\ < 2mpj\X\-^^ U {z : \z\ < R} . 

f,{X)=0,\\\>R 

6.1.2. Comparison to a polynomial problem. We now consider the problem 
of localizing eigenvalues for the Hadeler example near the real axis. To do this, we 
approximate the Hadeler function 

T{z) = S(exp(z) -l)+Az-aI 

by a polynomial 

P{z) = Bq{z) + Az - al 

where q{z) is the polynomial interpolating exp(2;) — 1 through a Chebyshev grid on 
some interval [zmin , -Zmax] guaranteed to contain all the eigenvalues, which we obtain 
using the Gershgorin bounds from the previous section. 

Suppose we write P{z) = Q(x) where z = (1 — a;)0„iin/2 + (1 + a;)zinax/2; that 
is, Q is a rescaled version of P. If we expand Q in terms of first-kind Chebyshev 
polynomials Tj as 

n 

Q{x) = Y,A,T,{x), 

j=o 

then det{A^^^Q{x)) = det(G — x/), where G is the colleague ynairix linearization [10] : 



A-^Ao A-^Ai ... A-^A^^i 



C 



21 

1 

/ 



/ 
/ 
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Note that if A is an eigenvalue of T, then it corresponds (after an appropriate rescaUng 
of variables) to an eigenvalue of 



C 



21 

1 I 
I 



/ 
/ 



A-^An-i^ 



where Aq - = (exp(A) - 1 - q{X)) B. 

By first balancing and then computing an eigendecomposition, we find S such 

that 

S-^CS = Dc. 

Furthermore, any eigenvalue A for the fully nonlinear problem is an eigenvalue of 

S-^CS = Dc+ r{X)S-^EoS, 

where r(A) = exp(A) — 1 — q(A) is the error in the Chebyshev approximation and Eq 
is a block matrix with A^^B/2 in the (n, 1) block and zeros elsewhere. Therefore, 
for any e > 0, the eigenvalues inside the region where \r{z)\ < e lie in the union of 
Gershgorin disks of radius epj about the eigenvalues of C, where pj are the absolute 
row or column sums of S^^EqS. The standard theory for convergence of Chebyshev 
approximations tells us that for an appropriate range of e values, \r(z)\ < e for z in a 
Bernstein ellipse whose radius depends on e; see [3TJ Chapter 8]. 

If we apply the above procedure with a degree 20 interpolant on the interval 
from Zniin = —7.7650 to Zmax — 3.3149, we obtain a polynomial eigenvalue problem 
whose eigenvalues are shown in Figure [6?3l The polynomial provides good estimates 
for the real eigenvalues, and reasonable estimates for the first clusters of complex 
eigenvalues near the real axis. The other eigenvalues of the polynomial interpolant 
do not approximate any eigenvalues of T. 



For this problem, the largest Gershgorin radius epj is less than 7e. Figure 6.4 
shows the region where \r{z)\ < e — 10^^°; the corresponding Gershgorin disks in 
this case are so tight that they are not visible in the figure. Thus, we can trust these 
approximations to the real eigenvalues to an absolute error of less than 1G~^. 

A more interesting bound involves the eigenvalues farther from the real axis. 
Based purely on the Chebyshev approximation to the Hadeler problem, it would not 
be entirely clear whether the cluster of eigenvalues with imaginary part near 6 is 
spurious or real. If we set e 0.1 and e = 1.6, we get the Gershgorin disks shown in 
Figure [6?5] these are sufhcient to show that the polynomial eigenvalue clusters closest 
to the real line also approximate eigenvalues of T, and to bound the approximation 
error. 

6.2. Time delay. Another example from the NLEVP toolbox is the time.delay 
example, which comes from applying transform methods to a delay differential equa- 
tion. The function is 



-zl + Ao + Ai exp(-z), 
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where Aq is a companion matrix and Ai is rank one. The spectrum and pseudospectra 



for this problem over part of the complex plane are shown in Figure 6.6 

As with the Hadeler example, we can get good estimates of the eigenvalues far 
from the origin by dropping the constant term in the problem. In order to analyze this 
case, let us transform T into a convenient basis. We choose a basis of eigenvectors V 
for Ai so that V^^AiV = Di = diag(/ii, 0, 0) and so that the trailing 2-by-2 submatrix 
oi E = V^^AqV is diagonal. The eigenvalues of T are thus also eigenvalues of 

f{z) = -zl + Di exp(-z) + E; 

and, as in the case of the Hadeler example, we can easily compute the eigenvalues of 
the related problem 

f{z) = -z/ + DiCxp(-z). 

The function T has a double eigenvalue at the origin corresponding to the zero eigen- 
values of Ai ; the remaining eigenvalues are solutions of the equation 

2;exp(z) = /ii « —13.3519, 

which must lie along contours of the form 

Zm{0) = (-61 + (2m - l)7r)(cot(0) + i). 

Eigenvalues of T must lie in the (column) Gershgorin region 

3 

|J{z : I - z + iijexp{-z)\ < pj}, 
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6 



-10 



-10 -8 -6 -4 -2 2 4 6 8 10 



Fig. 6.4. Region where the interpolation error r(z) for the Chebyshev approximation to exp{z) — 
1 is bounded by e = 10"'^''. The Gershgorin disks in this case are all distinct, and all have radii less 
than 10-^. 




Fig. 6.5. Region where the interpolation error r{z) for the Chebyshev approximation to exp{z) — 
1 is bounded by e = 0.1 (left) and e = 1.6 (right). The Gershgorin disks of radii epj are shown in 
green in each case. 



where pj are the absolute column sums of E. In Figure 6.6 we plot this region in the 
com plex plane, with the location of the eigenvalues of T marked by asterisks. Theo- 



3.1 



gives us that each component contains the same number of eigenvalues of T 
and T. Note in particular that this means that the central "blob" in the pseudospec- 
trum must contain exactly six eigenvalues of T, as indeed it does - two eigenvalues 
closer to the origin, and a pair of degenerate double eigenvalues at ±9.4248i. 



The Gershgorin regions shown in Figure 6.7 obtained by comparing T to T extend 



far into the right half plane. We can tighten our inclusion region somewhat by also 
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Fig. 6.6. Spectrum (in dots) and pseudo spectra for the time delay example. 



comparing T to ~ zl and letting Ai exp(— z) be the error term. Define 

f (z) = V-'^T{z)V = Da-zl + ^exp(-z) 

where = V^^AqV and E — V^^AiV. Applying the nonlinear Gershgorin argu- 
ment, any eigenvalue of T must live in the union of the regions \di — z\ < 7i| exp(— z)|, 
where 7^ is an absolute row sum of E. This region is bounded from the right by the 



contour shown in Figure 6.8 Intersecting these bounds with the previous Gershgorin 
bounds give very tight control on the spectrum. 

6.3. Resonances. Our final example is a problem associated with resonances of 
a Schrodinger operator on the positive real line fST, 'F . We seek the values of A such 
that the following two-point boundary value problem has nontrivial solutions: 

+ (6.1) 

V'(O) = and tp'ih) = iy/Xi>{b), 

where V equals Vq > on (a, b) and is zero elsewhere. In our computations, we 



used (a, 6) = (2,3) and Vq = 5. We formulate (6.1) as a finite-dimensional nonlinear 



eigenvalue problem by shooting from to a and from a to 6 [H Chapter 7], described 
as follows. 



Rewriting (6.1) in first-order form, we have 
du 



dx 



Then the matrices 

^Oa(A) = exp 





V -\ 



u, where u{x) 



ipix) 



(6.2) 



RabW =cxp (6 -a) 





Vo-X 
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Fig. 6.7. Spectrum of the approximation f (stars) to the time delay problem T, Gershgorin 
regions (thick line), and pseudospectra for T . 



respectively map m(0) u{a) and u{a) i— >■ u{b). Thus, (6.1) is equivalent to the 
six-dimensional nonlinear eigenvalue problem 



T(A)Uall = 



i?Oa(A) -/ 
Rab{X) 
1 0" 













-/ 





u{0) 
u{a) 
u{b) 



= 0. 



(6.3) 



In the next section, we derive a rational approximation T ^ T whose linearization 



K — AM corresponds to a discretization of (6.1 1. We then use the eigenvalues of T as 
starting points to compute eigenvalues of T, and establish by Theorem |4.4| that this 
procedure finds all eigenvalues of T in a region of interest. 

6.3.1. Rational approximation of the resonance problem. We construct 
K — AM by introducing auxiliary variables y whose elimination produces a rational 
approximation to r(A). That is, we write A{X) = K — AM so that 



"All (A) 






Wall 




T(A)Man 


_A2l(A) 


A22(A) 




. y . 








(6.4) 



If we eliminate the auxiliary variables and the equations that define them, we are left 
with a rational approximation to T(A) given by the leading 1-by-l Schur complement 
in A: 



r(A) « r(A) An(A) - Ai2(A)A22(A)-iA2i(A). 
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Fig. 6.8. Gershgorin region for a simplified problem with the exponential term dropped (left of 
the dashed line). This is superimposed on the Gershgorin regions from Figure \6. 7| 




Fig. 6.9. Linearized rational approximation to ( |6.3| l. The rational eigenvalue problem is a 
Schur complement in a linear eigenvalue problem obtained by eliminating all but the first six variables 
and equations. The overall matrix is assembled from linear matrix-valued functions A^°'{X) (cross- 
hatched), A°'^(X) (plaid), and (X) (dots) that generate rational approximations to Roai^) , Rabi^), 
and —i\/~\, respectively. 
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More precisely, we will define Aii{X) to be the constant part of r(A), i.e. 






-/ 











-/ 


1 0' 











1 



^ii(A) 



then add three submatrices A'^"(A), A"'' (A), and (X) (to be defined in a moment) 
that generate rational approximations to the nonlinear terms i?oa(A), Rab{X), and 
—i^fX when the Schur complement in A is taken. The structure of the matrix A in 
terms of these submatrices is shown schematically in Figure |6.9[ 

To define the rational approximation to i?oa(A), we start by writing the exact 
function i?oa(A)M(0) via the equations 







- 



B{a) ■ 
B(0) 

- A 



4> 



RoaiX)u{0) 





(6.5) 



where 



If we discretize (6.5 1 by replacing ^ with a vector of function values at sample 



points, and correspondingly replace the operators in the second column in (6.5) with 
discrete approximations, we are left with a matrix equation 



yl(°")(A) 







K 



B{a) 

m 

XM. 



H 



H 



MO) 



RoaiXHO) 





(6.6) 



where Kh and Mh are some fixed matrices of dimension {N — 2) x N. For our 
problem, we set to be function values at a Chebyshev mesh of TV = 40 points 
on [0, a], and (Kj^ — XM^) represents a pseudospectral collocation discretization of 
-d^jdx^ - A [SO]- The matrix A^^iX) is defined similarly. 

To define A^ , we begin with the best max-norm rational approximation to z^^l"^ 
on an interval [m, Af], which was first discovered in 1877 by Zolotarev ^16i §5.9]. The 
approximation is 

z-^/^ « r{z) = £ 



2-0 



where the poles and the weights 7^ are defined in terms of elliptic integrals; for 
details, we refer to Method 3 of jT5]. We approximate —iy/X by ~i/r{X), which we 
encode as the leading 1-by-l Schur complement in 



A^(A) 



1 



7i 

a-A 



72 



6- A 



Nz 



X 



For our problem, we use the Zolatorev approximation with Nz 
optimality on the interval [m,M] — [0.1,500]. 



20 poles, chosen for 
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-50 50 100 150 200 250 300 350 400 -10 -S -6 -4 -2 2 4 6 8 10 



Fig. 6.10. Computed eigenvalues for Vo = 5, r = 2, R = 3, using 20 poles in a Zolotarev square 
root approximation optimal on [0.1,500], and Chebyshev meshes of size 40. Circled eigenvalues 
satisfy ||T'(A)|| > 10~*. Contour plots of logj^Q{\\T{z) — T{z)\\) and an ellipse on which the smallest 
singular value of T{z) is greater than 10~* (l^ft)- A closer view (right). 



6.4. Analysis of the rational approximation. Our goal in this section will be 
to find all eigenvalues in the region D bounded by the ellipse T shown in Figure |6.10[ 
D is clearly contained in fig, where e = 10^^. Moreover, F was chosen so that 



< e for all z G F. This means that the contour F does not intersect the 



e-pseudospectrum of T, and hence any connected compone nt of (T) in D contains 
the same number of eigenvalues of T and T (by Theorem 4.4). It follows that the 
same number of eigenvalues of T and T lie in D. 

Since the norm of the perturbation is small there, we expect that the eigenvalues 
of T in are very good approximations to those of T. We refine each these eigenvalue 
estimates by Newton iteration on the test function g{z) defined by a bordered system 



-T{z) b 




V 




■q" 











1 



see [131 Chapter 3]. Indeed, each iteration has a distinct limit with a residual error 
on the order of machine precision, thus showing that we have obtained all the eigen- 
values of T within the ellipse. The difference between each eigenvalue of T and the 
corresponding eigenvalue of T is shown in Table |6.1| 



7. Conclusion. In this paper, we have described several localization theorems 
for the spectrum of a regular analytic function T : 17 — t' C"^". These pseudospectral 
and Gershgorin inclusion results generalize well-known perturbation theory for the 
standard eigenvalue problem. We have also shown through several examples how 
these results may be used as a practical tool to localize the nonlinear spectrum, 
count eigenvalues in parts of the complex plane, and judge which eigenvalues from 
an approximating eigenvalue problem are accurate approximations of true eigenvalues 
and which arc spurious. 
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